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Abstract 

Integer partitions may be encoded as either ascending or descending 
compositions for the purposes of systematic generation. Many algorithms 
exist to generate all descending compositions, yet none have previously 
been published to generate all ascending compositions. We develop three 
new algorithms to generate all ascending compositions and compare these 
with descending composition generators from the literature. We analyse 
the new algorithms and provide new and more precise analyses for the 
descending composition generators. In each case, the ascending composi- 
tion generation algorithm is substantially more efficient than its descend- 
ing composition counterpart. We develop a new formula for the partition 
function p{n) as part of our analysis of the lexicographic succession rule 
for ascending compositions. 

1 Introduction 

A partition of a positive integer n is an unordered collection of positive inte- 
gers whose sum is n. Partitions have been the subject of extensive study for 
many years and the theory of partitions is a large and diverse body of knowl- 
edge. Partitions are a fundamental mathematical concept and have connections 
with number theory [4], elliptic modular functions [48, p. 224], Schur algebras 
and representation theory [33, p. 13], derivatives [61], symmetric groups [10, 7], 
Gaussian polynomials [6, ch.7] and much else [2]. The theory of partitions also 
has many and varied applications [12, 41, 30, 59, 18, 1, 57]. 

Combinatorial generation algorithms allow us to systematically traverse all 
possibilities in some combinatorial universe, and have been the subject of sus- 
tained interest for many years [28]. Many algorithms are known to generate 
fundamental combinatorial objects; for example, in 1977 Sedgewick reviewed 
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more than thirty permutation generation algorithms [49] . Many different orders 
have been proposed for generating combinatorial objects, the most common be- 
ing lexicographic [23] and minimal- change order [46]. The choice of encoding, 
the representation of the objects we are interested in as simpler structures, is of 
critical importance to the efficiency of combinatorial generation. 

In this paper we demonstrate that by changing the encoding for partitions 
from descending compositions to ascending compositions we obtain significantly 
more efficient generation algorithms. We develop three new algorithms under 
the most common generation idioms: recursion (Section 2), succession rules 
(Section 3). and efficient sequential generation (Section 4). In each case we 
rigorously analyse the new algorithm and use this analysis to compare with a 
commensurable algorithm from the literature, for which we provide a new and 
more precise analysis. These analyses are performed using a novel application of 
Kemp's abstraction of counting read and write operations [23] and this approach 
is validated in an empirical study (Section 4.3). In all three cases the new 
ascending composition generation algorithm is substantially more efficient than 
the algorithm from the literature. As part of our study of partition generation 
algorithms we provide a new proof of a partition identity in Section 4.1.1. We 
also develop a new formula for the partition function p{n), one of the most 
important functions in the theory of partitions [5], in Section 3.4. 

1.1 Related Work 

A composition of a positive integer n is an expression of n as an ordered sum 
of positive integers [52, p. 14], and a composition ai + • • • + Ofc = n can be 
represented by the sequence oi . . . . Since there is a unique way of expressing 
each partition of n as composition of n in either ascending or descending order ^, 
we can generate either the set of ascending or descending compositions of n 
in order to obtain the set of partitions. More precisely, we can say that we 
are encoding partitions as cither ascending or descending compositions for the 
purposes of systematic generation. 

Although partitions are fundamentally unordered they have come to be de- 
fined in more and more concrete terms as descending compositions. This trend 
can be clearly seen in the works of Sylvester [56], MacMahon [32, Vol.11 p. 91] 
and finally Andrews [4, p.l]. Sylvester's "constructive theory of partitions", 
based on the idea of treating a partition as "definite thing" [56] (in contrast to 
Euler's algebraical identities [20]), has been extremely successful [39]. As a re- 
sult of this, partitions are now often defined as descending compositions [4, p.l]; 
thus, algorithms to generate all partitions have naturally followed the prevailing 
definition and generated descending compositions. 

It is widely accepted that the most efficient means of generating descending 
compositions is in reverse lexicographic order: see Andrews [4, p. 230] , Knuth [27, 
p.l], Nijenhuis & Will [36, p.65-68]. Page & Wilson [38, §5.5], Skiena [50, p.52], 

^For our purposes the terms 'ascending' and 'descending' are synonymous with 'nonde- 
creasing' and 'nonincreasing', respectively. 
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Stanton & White [53, p.l3], Wells [60, p.l50] or Zoghbi & Stojmcnovic [62]. 
Several different representations (concrete data structures) have been used for 
generating descending compositions: namely the sequence [34], multiplicity [36, 
ch.9] and part-count [55] representations. Although the lexicographic succession 
rules for descending compositions in the multiplicity or part-count representa- 
tions can be implemented looplessly [14], they tend to be less efficient that their 
sequence representation counterparts [27, ex.5]. In an empirical analysis, Zoghbi 
& Stojmenovic [62] demonstrated that their sequence representation algorithms 
are significantly more efficient than all known multiplicity and part-count rep- 
resentation algorithms. 

Algorithms to generate descending compositions in lexicographic order have 
also been published. See Knuth [25, p. 147] and Zoghbi & Stojmenovic [62] 
for implementations using the sequence representation; Rcingold, Nievcrgclt 
& Deo [42, p. 193] and Fenner & Loizou [16] for implementations using the 
multiplicity representation; and Klimko [24] for an implementation using the 
part-count representation. Fenner & Loizou's tree construction operations [15] 
can be used to generate descending compositions in several other orders. 

Several algorithms are known to generate descending fc-compositions in lex- 
icographic [19, 58], reverse lexicographic [43], and minimal-change [45] order. 
Hindenburg's eighteenth century algorithm [13, p. 106] generates ascending k- 
compositions in lexicographic order and is regarded as the canonical method to 
generate partitions into a fixed number of parts: see Knuth [27, p. 2], Andrews [4, 
p. 232] or Reingold, Nievergelt & Deo [42, p. 191]. Algorithms due to Stock- 
mal [54], Lehmer [31, p. 26], Narayana, Mathsen & Sarangi [35], and Boyer [9] 
also generate ascending fc-compositions in lexicographic order. Algorithms to 
generate all ascending compositions, however, have not been considered. 

1.2 Notation 

In general we use the notation and conventions advocated by Knuth [26, p.l], 
using the term visit to refer to the process of making a complete object available 
to some consuming procedure. Thus, any combinatorial generation algorithm 
must visit every element in the combinatorial universe in question exactly once. 
In discussing the efficiency of combinatorial generation, we say that an algorithm 
is constant amortised time [44, §1.7] if the average amount of time required to 
generate an object is bounded, from above, by some constant. 

Ordinarily, we denote a sequence of integers as ai . . .afe, which denotes a 
sequence of k integers indexed ai, a2, etc. When referring to short specific 
sequences it is convenient to enclose each element using ( and ). Thus, if we 
let at. . .Ok = (3) (23), we have /c = 2, oi = 3 and a2 = 23. We will also use 
the idea of prepending a particular value to the head of a sequence: thus, the 
notation 3 • (23) is the same sequence as given in the preceding example. 

Definition 1.1. A sequence of positive integers a\. . .ak is an ascending com- 
position of the positive integer n if a\-\ + afe = n and ai < • • • < . 
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Definition 1.2. Let A{n) be the set of all ascending compositions of n for 
some n> 1, and let A{n,m) C A{n) he defined for 1 < m < n as A{n,m) = 
{ai...ak I ai...ak € A{n) and ai > m}. Also, let A{n) = \A{n)\ and 
A{n, m) = \A{n, m)|. 

Definition 1.3. A sequence of positive integers d\. . .dk is a descending com- 
position of the positive integer n if di + ■ ■ ■ + dk = n, and di > ■ ■ ■ > dk- 

Definition 1.4. Let 'D{n) he the set of all descending compositions of n for 
some n > 1, and let V* (n, m) C V{n) he defined for 1 < m < n as V* (n, m) = 
{di . . .dk \ di . . .dk & T^{n) and d\ = m}. Also, let D{n) = \'D{n)\ and D*{n) = 
\V*{n,m)\. 

There is an asymmetry between the function used to enumerate the ascend- 
ing compositions and the descending compositions: A{n, m) counts the ascend- 
ing compositions of n where the first part is at least m, whereas D* (n, m) counts 
the number of descending compositions of n where the first part is exactly m. 
This asymmetry is necessary as we require A{n, m) in our analysis of ascending 
composition generation algorithms and D* {n, m) is essential for the analysis of 
the recursive descending composition generation algorithm of Section 2.2. 

2 Recursive Algorithms 

In this section we examine recursive algorithms to generate ascending and de- 
scending compositions. Recursion is a popular technique in combinatorial gener- 
ation as it leads to elegant and concise generation procedures [44]. In Section 2.1 
we develop and analyse a simple constant amortised time recursive algorithm 
to generate all ascending compositions of n. Then, in Section 2.2 we study 
Ruskey's descending composition generator [44, §4.8], and provide a new analy- 
sis of this algorithm. We compare these algorithms in Section 2.3 in terms of the 
total number of recursive invocations required to generate all p{n) partitions of 
n. 

2.1 Ascending Compositions 

The only recursive algorithm to generate all ascending compositions available 
in the literature is de Moivre's method [11]. In de Moivre's method we generate 
the ascending compositions of n by prepending m to the (previously listed) 
ascending compositions of n — m, for to = 1, . . . , n [28, p. 20] . Our new recursive 
algorithm to generate all ascending compositions of n operates on a similar 
principle, but does not require us to have large sets of partitions in memory. 

We first note that we can generate all ascending compositions of n, with 
smallest part at least m, by prepending m to all ascending compositions of 
n — TO. We then observe that m can range from 1 to [n/2j, since the smallest 
part in a partition of n (with more than one part) cannot be less than 1 or greater 
than [n/2j ; and we complete the process by visiting the singleton composition 
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Algorithm 2.1 REcAsc(n, m, fc) 
Require: 1 < m < n 

1: X <— m 

2: while 2a; < n do 

3: ak <— X 

4: REcAsc(n - x,a;, fc + 1) 

5; X <— a; + 1 

6: end while 

7: Uk n 

8: visit ai . . . ttfe 



(n) . This provides sufficient information for us to derive a recursive generation 
procedure, Algorithm 2.f , in the idiom of Page & Wilson [38]. This algorithm 
generates all ascending compositions of n where the first part is at least m in 
lexicographic order. See Kelleher [22, §5.2.1] for a complete discussion and proof 
of correctness of Algorithm 2.1. 

Following the standard practise for the analysis of recursive generation al- 
gorithms, we count the number of recursive calls required to generate the set 
of combinatorial objects in question (e.g. Sawada [47]). By counting the total 
number of recursive invocations required, we obtain a bound on the total time 
required, as each invocation, discounting the time spent in recursive calls, re- 
quires constant time. To establish that Algorithm 2.1 generates the set A{n) 
in constant amortised time we must count the total number of invocations, 
lA2.i{n), and show that this value is proportional to p{n). 

Theorem 2.1. For all positive integers n, lA2.i{n) =p{n). 

Proof. Each invocation of Algorithm 2.1 visits exactly one composition (line 8). 

The invocation REcAsc(n, m, 1) correctly visits allp(n) ascending compositions 
of n [22, p. 78] and it immediately follows, therefore, that there must be p{n) 
invocations. Hence, lA2.i{n) =p{n). □ 

Theorem 2.1 gives us an asymptotic measure of the total computational 
effort required to generate all partitions of n using Algorithm 2.1. It is also 
useful to know the average amount of effort that this total implies per partition. 
Therefore, we let lA2.i{n) denote the average number of invocations of RecAsc 
required to generate an ascending composition of n. We then trivially get 

lA2.i{n) = 1 (1) 

from Theorem 2.1, and we can see that Algorithm 2.1 is obviously constant 
amortised time. 

In this subsection we have developed a new algorithm to generate all as- 
cending compositions of n. This algorithm, although concise and simple, can 
be easily shown to be constant amortised time. In the next subsection we ex- 
amine the most efficient known algorithm to generate descending compositions. 
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Algorithm 2.2 RECDESC(n, m, k) 
Require: 1 < m < n and dj = 1 for j > fc 
1: dk m 

2: if n = m or m = 1 then 

3: visit di... dk+n-m 

4: else 

5: for ^ 1 to min(TO, n — m) do 
6; REcDESC(n - m,a;, A; + 1) 
7: end for 
8; dfe ^ 1 

9: end if 



which we subsequently compare to the ascending composition generator of this 
subsection. 

2.2 Descending Compositions 

Two recursive algorithms arc available to generate all descending composi- 
tions of n: Page & Wilson's [38, §5.5] generator (variants of which have ap- 
peared in several texts, including Kreher & Stinson [29, p. 68], Skiena [50, p. 51] 
and Pemmaraju & Skiena [40, p. 136]) and Ruskey's improvement thereof [44, 
§4.8]. Ruskey's algorithm, given in Algorithm 2.2, generates all descending 
compositions of n in which the first (and largest) part is exactly m; thus 
RecDesc(8,4, 1) visits the compositions 41111,4211,422,431,44. RecDesc 
uses what Ruskey refers to as a 'path elimination technique' [44, §4.3] to attain 
constant amortised time performance. 

A slight complication arises when we wish to use RecDesc to generate all 
descending compositions. As the algorithm generates all descending composi- 
tions where the first part is exactly m, wc must iterate through all j G {1, . . . , n} 
and invoke RECDESC(n, j, 1). Following Ruskey's recommendations [44, §4.3], 
we consider instead the invocation REcDESC(2n, n, 1). This invocation will 
generate all descending compositions of 2n where the first part is exactly n; 
therefore the remaining parts will be a descending composition of n. Thus, if 
we alter line 3 to ignore the first part in d (i.e. visit ^2 • • • dk+n-m), we will visit 
all descending compositions of n in lexicographic order. 

Ruskey's algorithm generates descending compositions where the largest part 
is exactly m, and so we require a recurrence relation to count objects of this 
type. Ruskey [44, §4.8] provides a recurrence relation to compute D*{n, m.), 
which we shall use for our analysis. Thus, we define D*{n,n) = D*{n, 1) = 1, 
and in general, 



Recurrence (2) is useful here because it is the recurrence relation upon which 
RecDesc is based. Using this recurrence we can then easily count the number 




(2) 
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of invocations of RecDesc required to generate the deseending eompositions of 
n. Let us define I'a2.2{^'''^) the number of invocation of RecDesc required 
to generate all descending compositions of n where the first part is exactly m. 
Then, I'^^^{n,n) = /^2.2(". 1) = 1' and 

min(m,n— m) 

lA2.2{n,m) = 1+ ^ lA2.2{n-m,x). (3) 

Recurrence (3) computes the number of invocations of Algorithm 2.2 required 
to generate all descending compositions of n with first part exactly m, but tells 
us little about the actual magnitude of this value. As a step towards solving 
this recurrence in terms of the partition function p(n) we require the following 
lemma, in which we relate the /^2.2('^'"^) numbers to the D*{n,m) numbers. 

Lemma 2.1. Ifl<m<n then I'a2.2{''^-> — + D*{n — 1, m). 

Proof. Proceed by strong induction on n. 

Base case: n = 2 Suppose 1 < m < 2; it follows immediately that m = 
2. Thus, by recurrence (3) we compute 7^2 2 (^'2) ~ ^ and by recurrence (2) 
compute D*{2,2) = 1 and £>*(!, 2) = 0. Therefore, /;i2.2(2,2) = r>*(2,2) + 
D*(l,2), and so the inductive basis holds. 

Induction step Suppose, for some positive integer n, /^2.2 C*^' > ™') = D*{n',m')+ 

D*{n' — 1,to') for all positive integers \ < m! < n' < n. Then, suppose m is an 
arbitrary positive integer such that 1 < m < n. Now, suppose m = n. By (3) 
we know that 7^2. 2('^'"^) = 1 since m = m. Also, D*{n,m) = 1 as m = n, and 
D*{n — 1, m) = as n — 1 m, m ^ I and min(m, n — m — 1) = —1, ensuring 
that the sum in (2) is empty. Therefore, 7^2.2(^5 = ^* (^i rnj + D* (n — 1, m). 

Suppose, on the other hand, that 1 < m < n. We can see immediately that 
min(?77,, 71 — m) > 1, and so there must be at least one term in the sum of (3). 
Extracting this first term where x = 1 from (3) we get 

min(m,n— m) 

lA2.2in,m) = l+lA2.2{n-'m,l)+ ^ 1^2.2 - '^i, x), 

x=2 

and furthermore, as 7^2.2(^1 1) = 1, we obtain 

min(m,n— m) 

rA2.2{n,m) = 2+ ^ l'A2.2{n-m,x). (4) 

x=2 

We are assured that l<x<n — mhy the upper and lower bounds of the 
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summation in (4), and so we can apply the inductive hypothesis to get 

min(m,n— m) 

/^2.2 ('^) = 2 + {D*{n — m,,x) + D*{n — m — l,x)) 

min(m,n— m) min(m,n— m) 

= 2+ ^ D*(n-rn,a;) + ^ D*(n - m - 1, x). 

x=2 x=2 

By the definition of D* wc know that D*{n, 1) = 1, and so D*{n — m, 1) + 
-D*(n — m — 1,1) = 2. Replacing the leading 2 above with this expression, and 
inserting the terms D*{n — m, 1) and -D*(n — m — 1, 1) into the appropriate 
summations we find that 

min(m,n— m) min(m,n— m) 

-'^A2.2('^>'^) = -D*(n — m, a;)+ £'*(n — m — l,a;) . (5) 

x=l x=l 

By (2) wc know that the first term of (5) is equal to the first term of 2('^> — 
D*{n, m) + D*{n— 1, m), it therefore remains to show that 

min(m,n— m) 

£>*(n-l,m)= ^ £)*(n-m- l,x), 
or equivalently, that 

min(m,n— m— 1) min(m,n— m) 

D*{n — m — l,x) = — m — 1, a;). (6) 

Suppose m < n — m — I. Then, min(m, n — m — 1) = min(m, n — to), and 
so the left and right-hand sides of (6) are equal. Suppose, alternatively, that 
m > n—m—1. Hence, min(m, n— m— 1) = n— m— 1 and min(TO, n — to) = n—m 
and so we get 

n—m n—m—1 

£>*(n — TO — 1, x) = D* {n — m — l,x) + D* {n — m — l,n — m). 

x=l x=l 

Since n — m — l<n — mwe know that D*{n — m — l,n — m) = 0, and therefore 
(6) is verified. 

Therefore, by (5) and (6) we know that 7^2. 2("''™) — D*{n,m) + D*{n — 
1,to), as required. □ 

Lemma 2.1 is a crucial step in our analysis of Algorithm 2.2 as it relates the 

number of invocations required to generate a given set of descending composi- 
tions to the function D*(n,m). Much is known about the D*{n,m) numbers, 
as they count the partitions of n where the largest part is to; thus, we can 
then relate the number of invocations required to the partition numbers, p{n). 
Therefore, let us formally define /a2.2(^) to be number of invocations of Algo- 
rithm 2.2 required to generate all p{n) descending compositions of n. We then 
get the following result. 
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Theorem 2.2. If n > 1 then lA2.2{n) = p{n) +p{n — 1). 

Proof. Suppose n > 1. To generate all descending compositions of n we in- 
voke RECDESC(2n, n, 1) (see discussion above), and as n > 1 we can apply 
Lemma 2.1, to obtain 7^2 2(2'^''^) — D*{2n,n) + D*{2n — l,n), and thus 
lA2.2in) = D*{2n,n) + D*\2n - l,n). We know that D*{2n,n) = p{n), as 
we can clearly obtain a descending composition of n from a descending com- 
position of 2n where the first part is exactly n by removing that first part. 
Similarly. D*{2n — l,n) = p{n — 1), as we can remove the first part of size 
n from any descending composition of 2n — 1 with first part equal to n, ob- 
taining a descending composition of n — 1. Thus, the descending compositions 
counted by the fimctions D*{2n, n) = p{n) and D*{2n—1, n) = p{n— 1). Hence, 
lA2.2{n) = p{n) + p{n — 1), completing the proof. □ 

Note that in Theorem 2.2, and in many of the following analyses, we restrict 
our attention to values n > 1. This is to avoid unnecessary complication of 
the relevant formulas in accounting for the case where n ~ 1. In the above, if 
we compute lA2.2{n) = p{n) + p{n — 1) for n = 1, we arrive at the conclusion 
that the number of invocations required is 2, as p(0) = 1 by convention. In the 
interest of clarity we shall ignore such contingencies, as they do not affect the 
general conclusions we draw. 

Using Theorem 2.2 it is now straightforward to show that RecDesc gener- 
ates all descending compositions of n in constant amortised time. To show that 
the algorithm is constant amortised time we must demonstrate that the average 
number of invocations of the algorithm per object generated is bounded, from 
above, by some constant. To do this, let us formally define lA2.2{n) as the 
average number of invocations of RecDesc required to generate a descending 
composition of n. Clearly, as the total number of invocations is /a2.2(?^) and 
the number of objects generated is p{n), we have lA2.2{n) — lA2.2{n)lp{n). 

Since lA2.2{n) = piji) + p(n — 1) by Theorem 2.2, we have lA2.2{n) = 1 + 
p{n)/p{n — 1). It is well known that p{n) > p{n — 1) for all n > 1, and therefore 
p{n — l)/p{n) < 1. From this inequality we can then deduce that lA2.2{n) < 2, 
proving that Algorithm 2.2 is constant amortised time. It is useful to have 
a more precise asymptotic expression for the average number of invocations 
required to generate a descending composition using RecDesc, lA2.2{n). By 
the asymptotic estimate for p{n — t)/p(n) [27, p. 11] we then get lA2.2{n) = 
1 -|- e~'^/ ^ (l -|- 0(n~^/^)) , with C = 7r/\/6. Simplifying this expression we get 

/A2.2(n) = 1 + e-"/^ (l + 0(n-i/6)^ . (7) 

In this subsection we have described and provided a new analysis for the 
most efficient known recursive descending composition generation algorithm, 
which is due to Ruskey [44, §4.8]. Ruskey demonstrates that RecDesc is con- 
stant amortised time by reasoning about the number of children each node in 
the computation tree has, but does not derive the precise number of invocations 
involved. In this section we rigorously counted the number of invocations re- 
quired to generate all descending compositions of n using this algorithm, and 
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Table 1: A comparison of recursive partition generators. The ratio of the time 
required by our ascending composition generation algorithm and Ruskey's al- 
gorithm in the Java and C languages is shown. 

61 72 77 90 95 109~ 

p(n) = 1.12 xlO^ 5.39 xlO« 1.06x10'' 5.66 xlO'' 1.05x10* 5.42x10* 

Java 0.56 0.56 0.56 0.55 0.55 0.55 
C 0.40 0.48 0.49 0.50 0.50 0.50 
Theoretical 0.54 0.54 0.53 0.53 0.53 0.53 



related the recurrence involved to the partition numbers. We then used an 
asymptotic formula for p{n) to derive the number of invocations required to 
generate each partition, on average. In the next subsection we use this analysis 
to compare Ruskey's descending composition generator with our new ascending 
composition generator. 

2.3 Comparison 

Performing the comparison between the recursive algorithms to generate all 
ascending compositions and to generate all descending compositions of n is 
a simple procedure. RecAsc requires p{n) invocations to generate all p{n) 
partitions of n whereas RecDesc requires p(n) + p{n — 1) invocations. The 
asymptotics of p{n) show that, as n becomes large, p{n — l)/p{n) approaches 
1. Thus, we can reasonably expect the descending composition generator to 
require approximately twice as long as the ascending composition generator to 
generate all partitions of n. 

In Table 1 we see a comparison of the actual time spent in generating par- 
titions of n using Ruskey's algorithm. Algorithm 2.2, and our ascending com- 
position generator. Algorithm 2.1. In this table we report the time spent by 
Algorithm 2.1 in generating all ascending compositions of n, divided by the 
time required by Ruskey's algorithm (we report these ratios as the actual du- 
rations are of little interest). Several steps were taken in an effort to address 
Sedgewick's concerns about the empirical comparisons of algorithms [49]. Direct 
and literal implementation of the algorithms concerned were written in the C 
and Java languages and compiled in the simplest possible manner (i.e., without 
the use of compiler 'optimisations'). Execution times were measured as accu- 
rately as possible and the minimum value over five runs used. The C programs 
were compiled using GCC version 3.3.4 and the Java programs compiled and 
run on the Java 2 Standard Edition, version 1.4.2. All programs were executed 
on an Intel Pentium 4 processor running Linux kernel 2.6.8. See Kelleher [22, 
p. 111-114] for a full discussion of the methodology adopted in making these 
observations. 

The values of n are selected such that n is the smallest integer where p{n) > 
1 X lO'^ and p{n) > 5 x lO'" for 6 < x < 8. Orders of magnitude larger than these 
values proved to be infeasible on the experimental platform; similarly, the time 
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elapsed in generating fewer than a million partitions was too brief to measure 
accurately. Along with the observed ratios of the time required by RecAsc 
and RecDesc we also report the theoretically predicted ratio of the running 
times: p{n)/{p{n) +p{n — 1)). We can sec from Tabic 1 that these theoretically 
predicted ratios agree well with the empirical evidence. We can also see that as 
n becomes larger, Ruskey's algorithm is tending towards taking twice as long 
as RecAsc to generate the same partitions. 

3 Succession Rules 

In this section we consider algorithms of the form studied by Kemp in his gen- 
eral treatment of the problem of generating combinatorial objects [23] . Kemp 
reduced the problem of generating combinatorial objects to the generation of 
all words in a formal language £, and developed powerful general techniques 
to analyse such algorithms. Specifically, Kemp studied "direct generation algo- 
rithms" that obey a simple two step procedure: (1) scan the current word from 
right-to-left until we find the end of the common prefix shared by the current 
word and its immediate successor; and (2) attach the new suffix to the end of 
this shared prefix. The cost of this process can be easily quantified by counting 
the number of 'read' operations required in step (1), and the number of 'write' 
operations in step (2). To determine the complexity of generating a given lan- 
guage, we can count the number of these operations incurred in the process of 
generating all words in the language. 

The section proceeds as follows. In Section 3.1 we derive a new succession 
rule for ascending compositions. We then use this succession rule to develop a 
generation algorithm, which wc subsequently analyse. Then, in Section 3.2 we 
examine the well-know succession rule for generating descending compositions 
in reverse lexicographic order, and analyse the resulting algorithm. Following 
this. Section 3.3 compares the two algorithms in terms of Kemp's read and write 
operations. Finally, in Section 3.4 we develop a new formula for p{n) using our 
analysis the succession rule for ascending compositions. 

3.1 Ascending Compositions 

We are concerned here with developing a simple succession rule that will allow 
us to generate the lexicographic successor of a given ascending composition, and 
using this rule to develop a generation algorithm. To do this it is convenient to 
define the following notation. 

Definition 3.1 (Lexicographic Minimum). For some positive integers m < n, 
the function M^(n,m) computes the lexicographically least element of the set 
A{n, m). 

Definition 3.2 (Lexicographic Successor). For any ai...ak € A{n) \ (n) 
the function S^{ai...ak) computes the immediate lexicographic successor of 
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The succession rule for ascending compositions is then stated simply. We 
obtain the lexicographically least composition in A{n, m) by appending m to the 
lexicographically least composition m A{n — m,m). If 2m > n then there are no 
compositions in A{n,m.) with more than one part, leading us to conclude that 
there is only one possible composition; and this must be the lexicographically 
least. This leads us to the following recurrence: 

M_4(n, m) = m ■ M_4(n — m, m) (8) 

where M^(n, m) = (n) if 2m > n. See Kelleher [22, p. 84] for a proof of (8). We 
can also derive a nonrecursive succession rule for S^, which we develop in the 
following sequence of results. 

Lemma 3.1. For all positive integers m <n, the lexicographically least element 

of the set A{n, m) is given by 

M^(n, m) = m .rri{n — fJ,m), (9) 

where fj, = [n/m\ — 1. 

Proof. Proceed by strong induction on n. 

Bgise case: n = 1 Since 1 < m < n and n = 1, then m = 1, and so 2m > n. 
Then, by (8), we know that M^(n,TO) = (n). Thus, as /x = when n = 1, (9) 
correctly computes M^(n, m) when n = 1. 

Induction step Suppose, for some positive integer n that (9) holds true for 
all positive integers m' < n' < n. Suppose m is an arbitrary positive integer 
such that m < n. Suppose then that 2m > n. By dividing both sides of this 
inequality by m, we see that n/m < 2, and so [n/m\ < 1. Similarly, as m < n, 
it follows that 1 < n/m, and so 1 < [n/m\. Thus, 1 < [n/m\ < 1, and so 
[n/mj = 1; hence fi = 0. By (8) M_4(ri, m) = (n), and as /i = 0, zero copies of 
m are concatenated with (n — i^m), and so (9) correctly computes M^(n, m). 
Suppose then that 2m < n. By the inductive hypothesis and (8) we have 

M^(n, m) =m ■ rnC. . ni {n — m — n'm) 

Clearly, if /i = /i' + 1, then (9) correctly computes the lexicographically least 
element of A{n,m). We know that /u' = [{n — m)/m\ — 1, which gives us 
= \n/m — IJ — 1. It follows that jj! = [n/m\ — 2, and, as = [n/mj — 1 
from (9), we have /i = /x' + 1, completing the proof. □ 

Theorem 3.1 (Lexicographic Successor). If ai . . .ak € A{n) \ {(n)} then 

S^(ai . . . ttfc) = ai . . . afc-2 m. . .m{n' — /im) (10) 
where m = ak-i + 1, n' = Uk-i + ak, and fi = [n'/mj — 1. 
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Algorithm 3.1 RuLEAsc(n) 
Require: n > 

1: fc^2 

2: ai <— 

3: a2 n 

4: while k 1 do 

5: J/ ^ flfe - 1 
6: /C <— fc - 1 
7: a; <— Ofe + 1 

8: while X < y do 

9: Cfe X 

10: y ^ y — X 

11: A: ^ fc + 1 

12: end while 

13: ak ^ x + y 

14: visit ai . . . Ofe 

15: end while 



Proof. Suppose n is an arbitrary positive integer. Let ai . . . ak be an arbi- 
trary element of A{n) \ {{n)}. Clearly, there is no positive integer x such that 
oi . . . Ofe_i(ofc + x) e A{n). The initial part of M^{ak + ak-i,ak-i + 1) is 
the least possible value we can assign to Uk-i; and the remaining parts (if any) 
are the lexicographically least way to extend ai . . . ak-i to a complete ascending 
composition of n. Therefore, S^(ai . . . a^) = ai . . . ak-2^A{(^k-i+(ik, dfe-i + l)- 
Then, using Lemma 3.1 we get (10) as required. □ 

The succession rule (10) is implemented in RuleAsc (Algorithm 3.1). Each 
iteration of the main loop visits exactly one composition, and the internal loop 
generates any sequences of parts required to find the lexicographic successor. 
We concentrate here on analysis of this algorithm; see Kelleher [22, §5.3.1] for 
a full discussion and proof of correctness. The goal of our analysis is to derive 
a simple expression, in terms of the number of partitions of n, for the total 
number of read and write operations [23] made in the process of generating all 
ascending compositions of n. We do this by first determining the frequency of 
certain key instructions and using this information to determine the number of 
read and write operations involved. 

Lemma 3.2. The number of times line 6 is executed during the execution of 
Algorithm 3.1 is given by tQ{n) =p{n). 

Proof. As Algorithm 3.1 correctly visits all p{n) ascending compositions of n, 
we know that line 14 is executed exactly p{n) times. Clearly line 6 is executed 
precisely the same number of times as line 14, and so we have t^ (n) = p{n) , as 
required. □ 

Lemma 3.3. The number of times line 11 is executed during the execution of 
Algorithm 3.1 is given by tii(n) = p{n) — 1. 
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Proof. The variable k is used to control termination of the algorithm. From 
line 1 we know that k is initially 2, and from line 4 we know that the algorithm 
terminates when k = 1. Furthermore, the value of k is modified only on lines 6 
and 11. By Lemma 3.2 we know that k is decremented p{n) times; it then 
follows immediately that k must be incremented p{n) — 1 times, and so we have 
tii{n) = p{n) — 1, as vequiied. □ 

Theorem 3.2. Algorithm 3.1 requires RA3.i{n) = 2p{n) read operations to 
generate the set A{n) . 

Proof. Read operations are carried out on lines 7 and 5, which are executed 
p{n) times each by Lemma 3.2. Thus, the total number of read operations is 

RA3.i{n) = 2p{n). □ 

Theorem 3.3. Algorithm 3.1 requires WA3.i{n) = 2p(n) — 1 write operations 
to generate the set A{n), excluding initialisation. 

Proof. After initialisation, write operations are carried out in Algorithm 3.1 only 
on lines 9 and 13. Line 13 is executed p{n) times by Lemma 3.2. We can also 
see that line 9 is executed exactly as many times as line 11, and by Lemma 3.3 
we know that this value is p{n) — 1. Therefore, summing these contributions, 
we get WA3.i{n) = 2p{n) — 1, completing the proof. □ 

From Theorem 3.3 and Theorem 3.2 it is easy to see that we require an aver- 
age of two read and two write operations per partition generated, as we required 
2p{n) of both operations to generate all p{n) partitions of n. Thus, for any value 
of n we are assured that the total time required to generate all partitions of n 
will be proportional to the number of partitions generated, implying that the 
algorithm is constant amortised time. 



3.2 Descending Compositions 

Up to this point we have considered only algorithms that generate compositions 
in lexicographic order. The majority of descending composition generation algo- 
rithms, however, visit compositions in reverse lexicographic order (McKay [34] 
refers to it as the 'natural order' for partitions). There arc many different pre- 
sentations of the succession rule required to transform a descending composition 
from this list into its immediate successor: see Andrews [4, p. 230], Knuth [27, 
p.l], Nijenhuis & Wilf [36, p.65-68], Page & Wilson [38, §5.5], Skiena [50, p.52], 
Stanton & White [53, p. 13], Wells [60, p. 150] or Zoghbi & Stojmenovic [62]. No 
analysis of this succession rule in terms of the number of read and write opera- 
tions [23] involved has been published, however, and in this section wc analyse 
a basic implementation of the rule (we study more sophisticated techniques in 
Section 4.2). 

If we formally define Sx)(rfi . • • rffe) to be the immediate lexicographic prede- 
cessor of a di . . . d/c G 'D{n) \1 . . .1, the succession rule can be formulated as 
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Algorithm 3.2 RuLEDESC(n) 
Require: n > 

1: rfi <— n 
2: k^l 

3; visit di 

4: while k do 

5: i^k 
6: m <— dq 

7: while m = 1 do 

8; fc <— A; - 1 
9: m-i— dq 

10: end while 

11: n' <— TO + ^ - fc 
12: TO <— TO — 1 

13: while TO < n' do 

14; dfc ^ m 

15: n' ^ n' — m 

16: fc <— fc + 1 

17: end while 
18: dk ^ n' 
19: visit d\. . .dk 
20: end while 



follows. Given a descending composition di . . .dk where di 7^ 1, we obtain the 
next composition in the ordering by applying the transformation 



where q is the rightmost non-1 value (i.e., dj > 1 for 1 < j < g and dj = 1 for 
q < j < k) , m = dq — 1, n' = dq + k — q and fj, = [n'/m\ — [n' mod to = 0]. This 
presentation can readily be derived from the treatments cited in the previous 
paragraph. 

The succession rule (11) is implemented in RuleDesc (Algorithm 3.2), 
where each iteration of the main loop implements a single application of the 
rule. The internal loop of lines 7-9 implements a right-to-left scan for the 
largest index q such that rf^ > 1, and the loop of lines 13-17 inserts /x copies of 
TO into the array. We analyse the algorithm by first determining the frequency 
of certain key statements, and using this information to derive the number of 
read and write operations needed to generate all descending compositions of n. 

Lemma 3.4. The number of times line 8 is executed during the execution of 
Algorithm 3.2 is given hy ti{n) = 1 — n + P{x)- 

Proof. As exactly one descending composition is visited per iteration of the outer 
while loop, we know that upon reaching line 6 there is a complete descending 



Sx)(di . . . dk) = d\. . . dq-i m . . .m{n' — iim) 




(11) 
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composition of n contained in di...dk. Furthermore, as rfi > ••• > rffe, we 
know that all parts of size 1 are at the end of the composition, and so it is 
clear that line 7 will be executed exactly once for each part of size 1 in any 
given composition. As wc visit the compositions at the end of the loop and we 
terminate when k = n we will not reach line 5 when the composition in question 
consists of n copies of 1 (as this is the lexicographically least, and hence the 
last descending composition in reverse lexicographic order). Thus, line 7 will be 
executed exactly as many times as there are parts of size 1 in all partitions of n, 
minus the n Is contained in the last composition. It is well known [21, p. 8] that 
the number of Is in all partitions of n is 1 + • • • -\-p{n — 1), and therefore 

we see that line 7 is executed exactly 1 — n + P{^)^ required. □ 

Lemma 3.5. The number of times line 16 is executed during the execution of 
Algorithm 3.2 is given by tie{n) — ^2=iP(-'^)- 

Proof. The variable k is used to control termination of Algorithm 3.2: the algo- 
rithm begins with k = 1 and terminates when k = n. Examining Algorithm 3.2 
we see that k is modified on only two lines: it is incremented on line 16 and 
decremented on line 8. Thus, we must have n — 1 more increment operations 
than decrements; by Lemma 3.4 there are exactly 1 — n + p{x) decre- 

ment operations, and so we see that line 14 is executed X^"ZiP(a;) times, as 
required. □ 

Theorem 3.4. Algorithm 3.2 requires RA3.2{n) = J22=iPi^) ~ ^ '''^^'^ opera- 
tions to generate the set T>{n). 

Proof. Read operations are performed on lines 6 and 9 of Algorithm 3.2. By 
Lemma 3.4 we know that line 8 is executed 1 — n 4- X]"=i Pi^) times, and so 
line 9 is executed an equal number of times. Clearly line 6 is executed p{n) — 1 
times, and so wc get a total of -Ra3.2(^) = X]"=iP(^) ~ required. □ 

Theorem 3.5. Algorithm 3.2 requires WA3.2{n) = Y^2=iPi^) ~ ^ write opera- 
tions to generate the set ©(n), excluding initialisation. 

Proof. The only occasions in Algorithm 3.2 where a value is written to the 
array d, are lines 14 and 18. By Lemma 3.5 we know that line 16 is executed 
exactly J22=i P(^) times, and it is straightforward to see that line 14 is executed 
precisely the same number of times. As we visit exactly one composition per 
iteration of the outer while loop, and all descending compositions except the 
composition (n) are visited with this loop, we then see that line 18 is executed 
p{n) — 1 times in all. Therefore, summing these contributions we get WA3.2{n) = 
Pi^) + Pi''^) ~ 1 = S"=i P(^) - 1 as required. □ 

Theorems 3.4 and 3.5 derive the precise number of read and write operations 
required to generate all descending compositions of n using Algorithm 3.2, and 
this completes our analysis of the algorithm. We discuss the implications of 
these results in the next subsection, where we compare the total number of read 
and write operations required by RuLEAsc(n) and RuLEDESC(n). 
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3.3 Compcirison 

In this section wc developed two algorithms. The first algorithm we considered, 
RuleAsc (Algorithm 3.1), generates ascending compositions of n; the second 
algorithm, RuleDesc (Algorithm 3.2), generates descending compositions of n. 
We analysed the total number of read and write operations required by these 
algorithms to generate all partitions of n by iteratively applying the succes- 
sion rule involved. The totals obtained, disregarding unimportant (i.e. 0(1) or 
0{n)) trailing terms, for the ascending composition generator are summarised 
as follows. 

RAs.i{n)^2p{n) and W^A3.i(n) ~ 2p(n) (12) 

That is, we require approximately 2p(n) operations of the form x aj and 
approximately 2p{n) operations of the form aj <~- x to generate all partitions 
of n using the ascending composition generator. Turning then to the descend- 
ing composition generator, we obtained the following totals, again removing 
insignificant trailing terms. 

n n 

i?A3.2(n)«^p(a;) and W^A3.2(n) « ^p(a:) (13) 

x=l x=l 

These totals would appear to indicate a large disparity between the algorithms, 
but we must examine the asymptotics of ^'(^) determine whether this 

is significant. We shall do this in terms of the average number of read and write 
operations per partition which is implied by these totals. 

We know the total number of read and write operations required to generate 
all p{n) partitions of n using both algorithms. Thus, to determine the expected 
number of read and write operations required to transform the average parti- 
tion into its immediate successor we must divide these totals by p{n). In the 
case of the ascending composition generation algorithms this is trivial, as both 
expressions are of the form 2p{n), and so dividing by p{n) plainly yields the 
value 2. Determining the average number of read and write operations using 
the succession rule for descending compositions is more difficult, however, as 
both expressions involve a factor of the form X]"=i-P(^)- 

Using the asymptotic expressions for p{n) we can get a qualitative estimate 
of these functions. Odlyzko [37, p. 1083] derived an estimate for the value of 
sums of partition numbers which can be stated as follows 



y^pix) = = (l + 0(n-i/^) 



2W2n 

Then, dividing this by the asymptotic expression for p{n) we get the following 
approximation 

-l-^±pix)^l + ^, (14) 

which, although crude, is sufficient for our purposes. The key feature of (14) is 
that the value is not constant: it is 0{^/n). Using this approximation we obtain 
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the following values for the number of read and write operations expected to 
transform a random partition of n into its successor. 

Reads Writes 

Ascending 2 2 

Descending 1 + 0.78v^ 1 + 0.78v^ 

We can see the qualitative difference between the algorithms by examining 
their read and write tapes in Figure 1. The tapes in question are generated 
by imagining that read and write heads mark a tape each time one of these 
operations is made. The horizontal position of each head is determined by the 
index of the array element involved. The tape is advanced one xmii each time a 
composition is visited, and so we can see the number of read and write operations 
required for each individual partition generated. Regarding Figure 1 then, and 
examining the read tape for RuleAsc, we can see that every partition requires 
exactly 2 reads; in contrast, the read tape for RuleDesc shows a maximum of 
n — 1 read operations per partition, and this oscillates rapidly as we move along 
the tape. Similarly, the write tape for RuleAsc shows that we sometimes need 
to make a long sequence of write operations to make the transition in question, 
but that these are compensated for — as our analysis has shown — by the 
occasions where we need only one write. The behaviour of the write head in 
RuleDesc is very similar to that of its read head, and we again see many 
transitions where a large number of writes are required. 

The difference between RuleAsc and RuleDesc is not due to some al- 
gorithmic nuance; rather, it reflects of a structural property of the objects in 
question. The total suffix length [23] of descending compositions is much greater 
than that of ascending compositions, because in many descending composition 
the suffix consists of the sequence of Is; and we known that the total number 
of Is in all partitions of n is P{'^)- well-defined way, it is more effi- 

cient to generate all ascending compositions than it is to generate all descending 
compositions. 

3.4 A new formula for p{n) 

Although not strictly relevant to our analyses of ascending and descending com- 
position generation algorithms, another result follows directly from the analysis 
of Algorithm 3.1. If we compare the lexicographic succession rule (10) and Al- 
gorithm 3.1 carefully, we realise that the /U copies of m must be inserted into 
the array within the inner loop of lines 8-12; and our analysis has given us the 
precise number of times that this happens. Therefore, we know that the sum 
of /U values over all ascending compositions of n (except the last composition, 
(n)), must equal the number of write operations made in the inner loop. Using 
this observation we then get the following theorem. 
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Read {154) 
RuleAsc(12) 



Write (154) 



p{12) = 77 



{271 ) Read 



{271 ) Write 
RuleDesc(12) 



Figure 1: Read and write tapes for the direct implementations of succession 
rules to generate ascending and descending compositions. On the left wc have 
the read and write tapes for the ascending composition generator, Algorithm 3.1; 
on the right, then, are the corresponding tapes for the descending composition 
generator. Algorithm 3.2. In both cases, the traces correspond to the read and 
write operations carried out in generating all partitions of 12. 
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Theorem 3.6. For alln> 1 



p{n) = 



( 



l+n+ 



\ 



A{n)\{(n)} 



ak-i + 1 



(15) 



Proof. We know from Lemma 3.3 that the total number of write operations 

made by Algorithm 3.1 in the inner loop of lines 8 12 is given by p{n) — 1. 
Algorithm 3.1 applies the lexicographic succession rule above to all elements 
of A{n) \ {('^)}, as well as one extra composition, which we refer to as the 
'initialisation composition'. The initialisation composition is not in the set A{n) 
as Oi = 0, and so we must discount the number of writes incurred by applying 
the succession rule to this composition. The composition visited immediately 
after On is 1 ... 1, and so n — 1 copies of 1 must have been inserted into the array 
in the inner loop during this transition. Therefore, the total number of writes 
made within the inner loop in applying the succession rule to all elements of 
A(n) \ {{n)} is given by p{n) — 1 — (n — 1) = p{n) — n. Therefore, from this 
result and the succession rule of Theorem 3.1 we get 



pin) -n= ^ { 



A{n)\{(n)} 



ttk-i + ak 
flfe-i + 1 



1 



from which it is easy to derive (15), completing the proof. 



□ 



We can simplify (15) if we suppose that all a\. . .ak G A{n) are prefixed by 
a value 0. More formally, a direct consequence of Theorem 3.6 is that 



P{n) = \\l 



E 



gfc-l + ak 
flfe-l + 1 



(16) 



where A! {n) = {Q ■ ai . . . ak \ ai . . . ak G A{n)}. Fundamentally, what Theo- 
rem 3.6 shows us is that if we let y be the largest part and x the second largest 
part in an arbitrary partition of n, we can count the partitions of n by summing 
\_{x + y)/{x + 1)J over all partitions of n. 

The partition function p{n) is one of the most important functions in the 
theory of partitions and has been studied for several centuries [5]. The asymp- 
totic [20] and arithmetic [2] properties of p{n) have been very thoroughly ex- 
amined. While (16) is clearly not an efficient means of computing p{n), it may 
provide some new insight into this celebrated function. 



4 Accelerated Algorithms 

In this section we examine algorithms that use structural properties of the sets of 
ascending and descending compositions to reduce the number of read and write 
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operations required. The algorithms presented are the most efficient known 
examples of ascending and descending composition generators, ensuring that 
we have a fair comparison of the algorithms arising from the two candidate 
encodings for partitions. In Section 4.1 we develop a new ascending compo- 
sition generator that requires fewer read operations than RuleAsc. Then, in 
Section 4.2 we study the most efficient known descending composition genera- 
tion algorithm, due to Zoghbi & Stojmenovic [62], which requires far fewer read 
and write operations than RuleDesc. In Section 4.3, we compare these two 
algorithms to determine which of the two is more efficient. 

4.1 Ascending Compositions 

In this subsection we improve on RuleAsc (Algorithm 3.1) by applying the the- 
ory of 'terminal' and 'nonterminal' compositions. To enable us to fully analyse 
the resulting algorithm we require an expression to enumerate terminal ascend- 
ing compositions in terms of p{n). In the opening part of this subsection we 
develop the theory of terminal and nonterminal compositions. A byproduct of 
this analysis is a new proof for a partition identity on the number of partitions 
where the largest part is less than twice the second largest part. After develop- 
ing this necessary theory, we move on to the description of the algorithm itself, 
and its subsequent analysis. 

4.1.1 Terminal and Nonterminal Compositions 

The algorithm that we shall examine shortly uses some structure within the 
set of ascending compositions to make many transitions very efficient. This 
structure is based on the ideas of 'terminal' and 'nonterminal' compositions. 
We now define these concepts and derive some basic enumerative results to aid 
us in our analysis. 

Definition 4.1 (Terminal Ascending Composition). For some positive integer 

n, an ascending composition ai . . . G A{n) is terminal ifk = 1 or2ak~i < cik- 
Let %i(n, m) denote the set of terminal compositions in A{n, m), and T_^{n, m) 
denote the cardinality of this set (i.e. T^{n,m) = |T4(n, m)|j. 

Definition 4.2 (Nonterminal Ascending Composition). For some positive in- 
teger n, fli . . . flfe G A{n) is nonterminal if k > 1 and 2ak-i > ak- Let Afj_{n, m) 
denote the set of nonterminal compositions in A{n,m), and let Nj^(n,m) denote 
the cardinality of this set (i.e. N^{n,m) = \N'^{n,m)\). 

If we let A{n, m) denote the number of ascending compositions of n where 
the initial part is at least m it can be shown [22, ch.3] that 



holds for all positive integers m < n. We require a similar recurrence to enu- 
merate the terminal ascending compositions, and so we let T^{n,m) denote the 



L«/2J 




(17) 



21 



number of terminal compositions in the set A{n,m). The terminal ascending 
compositions are a subset of the ascending compositions, and the construction 
rule impHed is the same: the number of terminal ascending compositions of n 
where the initial part is exactly m is equal to the number of terminal composi- 
tions of n — TO with initial part at least to. The only difference, then, between 
the recurrences for ascending compositions and terminal ascending compositions 
occurs in the boundary conditions. The recurrence can be stated as follows: for 
all positive integers m < n, T_^{n,m) satisfies 

[n/3\ 

T4(n,m) = l+ J2 TA{n-x,x). (18) 

x—m 

See Kelleher [22, p. 160-161] for the proofs of recurrences (17) and (18). 

Before we move onto the main result, where we prove that Ty^{n,m) = 
A{n,m.) — A{n — 2, to), we require some auxiliary results which simplify the 
proof of this assertion. In Lemma 4.1 we prove an equivalence between logical 
statements of a particular form involving the floor function, which is useful in 
Lemma 4.2; the latter lemma then provides the main inductive step in our proof 
of the central theorem of this section. In the interest of brevity, we limit our 
proofs to values of n > 3, since n < 3 can be easily demonstrated and would 
unnecessarily complicate the proofs. 

Lemma 4.1. If x, m and n are positive integers then x < [{n — x)/m\ 

X < [n/{m + 1)J . 

Proof. Suppose x, m and n are positive integers. Suppose x < [(n — x)/m\. 
Thus, X < {n — x)/m, and so x < n/(m + l). Then, as [n/(m + l)J < n/(TO-|-l) 
and X is an integer, we know that x < [n/ (m + 1)J , and so x < [(n — x)/m\ => 
X < [n/ (to, + 1)J . 

Suppose that x < [n/(TO-|- 1)J . Then, x < n/{m+l), and so x < {n~ x)/m. 
Once again, as x is an integer it is apparent that x < [{n — x)/m\ < {n — x)/m, 
and so a; < [n/{m + 1)J x < [{n — x)/m\. Therefore, as x < [(n — 

x)/m\ =4> X < [n/(TO + 1)J and X < [n/(m + 1)J => x < [{n — x)/m\ we 
see that x < [(n — x)/mj <s=^ x < [n/{m + 1)J, as required. □ 

Lemma 4.2. For all positive integers n > 3 

L»/2J L("-2)/2j 

^ A{n-x,x) = l+ ^ A{n-2-x,x). (19) 

x=[n/3J + l a;=[n/3J+l 

Proof. Suppose n > 3 and 1 < m < n, and consider the left-hand side of (19). 
We know that A{n,m) = 1 if to > [n/2j, as the summation in recurrence 
(17) will be empty. By the contrapositive of Lemma 4.1 we know that x > 
[(n — x)/2\ X > [?T./3J, and we therefore know that each term in the 

summation of the left-hand side of (19) is equal to 1. Thus, we see that 

L«/2J 

A{n - X, x) = [n/2j - [n/3j - 1. (20) 

x=[n/3J-|-l 



22 



Similarly, as x > [n/3j =^ x > [(n — x)/2\, it clearly follows that 
x>ln/3\ ^ a; > [(n- a;)/2j - 1, or a; > [n/3j =^ x > [{n - 2 - x)/2\. 
Thus, each term in the summation on the right-hand side of (19) must also 
equal 1, and so we get 

L(«-2)/2j 

1+ J2 A{n-2-x,x) = l+[{n-2)/2\-[n/3\-l 

x=[n/3J+l 

= [n/2\ - [n/3\ - 1. (21) 

Therefore, as (20) and (21) show that the left-hand and right-hand side of (19) 
are equal, the proof is complete. □ 

Theorem 4.1. If n > 3, then Tj,{n,m) = A{n,m) — A{n — 2,m) for all 1 < 

m < ln/2\. 

Proof. Proceed by strong induction on n. 

Base case: n = 3 As 1 < m < [n/2j and n = 3, we know that m = 1. 
Computing r4(3, 1), we get 1 + T_a{2, 1) = 2. We also find ^(3, 1) = 3 and 
^(1, 1) = 1, and so the base case of the induction holds. 

Induction step Suppose Tj\(n' , m) = A{n\ m) — A{n' — 2, m) when \ <m < 
[n'/2j, for all 3 < n' < n, and some integer n. Then, as x < [(n — x)/2\ <J=^ 
x < [n/3j , by Lemma 4.1, we can apply this inductive hypothesis to each term 
TA{n — ^) in (18), giving us 

L"/3J 

T4(n, to) = 1 + i^i'iT' — x,x) — A{n — 2 — x, x)) 

x=m 

Ln/3J L"/3J 

= 1 + ^ A{n -x,x)- ^ A{n -2-x,x). (22) 

x=m x=Tn 

By Lemma 4.2 we know that 

L"/2J L(«-2)/2J 

A{n-x,x)- ^ A{n-2-x,x)-l = 0, 

x=[n/3]+l a:=Lri/3j+l 

and so we can add the left-hand side of this equation to the right-hand side of 
(22), to get 

L"/3J L"/3J 

T4(n, to) = 1 + A{n -x,x) - ^ A{n -2-x,x) 

x=m x=m 

L"/2J L("-2)/21 

+ A{n-x,x)- A{n-2-x,x)-l. 

a;=[n/3J+l x=[n/3j+l 
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Then, gathering the terms A{n — x, x) and A{n — 2 — x,x) into the appropriate 
summations we get 

L"/2J L(»-2)/2j 

T^{n,m) = 1+ ^ A{n- x,x) -1- ^ A{n-2-x,x), 

x=m x=m 

which by (18) gives us Tj({n, m) = A{n, m) — A{n — 2, m), as required. □ 

For the purposes of our analysis it is useful to know the total number of 

terminal and nonterminal compositions of n, and it is worthwhile formalis- 
ing the results here for reference. Therefore, letting T_4^{n) = T_^{n, 1) and 
^A{n) = Nj({n, 1), we get the following corollaries defined in terms of the par- 
tition function p{n). 

Corollary 4.1. For all positive integers n, Tj({n) = p{n) — p{n — 2). 

Proof. As Tjx{n) = Tjx{n, 1) and A{n, 1) — p{n), proof is immediate by Theo- 
rem 4.1 for all n > 3. Since p{n) = for all n < and p{0) = 1, we can readily 
verify that T4(2) = r4(l) = 1, as required. □ 

Corollary 4.2. If n is a positive integer then N_A{n) = p(n — 2). 

Proof. An ascending composition is either terminal or nonterminal. As the 
total number of ascending compositions of n is given by p{n), we get N^{n) = 
p{n) — {p{n) — p{n — 2)) = p{n — 2), as required. □ 

Corollaries 4.1 and 4.2 prove a nontrivial structural property of the set of 
all ascending compositions, and can be phrased in more conventional partition 
theoretic language. Consider an arbitrary partition of n, and let y be the largest 
part in this partition. We then let x be the second largest part {x <y). Corol- 
lary 4.2 then shows that the number of partitions of n where 2a; > y is equal to 
the number of partitions of n— 2. This result is known, and has been reported by 
Adams- Watters [51, Seq.A027336]. The preceding treatment, however, would 
appear to be the first published proof of the identity. 

4.1.2 Algorithm 

Having derived some theoretical results about the terminal and nonterminal 

ascending compositions of n, we arc now in a position to exploit those prop- 
erties in a generation algorithm. In the direct implementation of the lexico- 
graphic succession rule for ascending compositions, RuleAsc, we generate the 
successor of ai . . . by computing the lexicographically least element of the 
set A{ak-i + afc,afc_i -I- 1), and visit the resulting composition. The algo- 
rithm operates by implementing exactly one transition per iteration of the main 
loop. The accelerated algorithm, AccelAsc, developed here operates on a 
slightly different principle: we compute the lexicographically least composition 
of A{ak-i+ak, cifc-i + 1), as before, but we now keep a watchful eye to see if the 
resulting composition is nonterminal. If it is, we can compute the lexicographic 
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Algorithm 4.1 AccELAsc(n) 
Require: n > 1 

1: fc^2 
2: ai 

3: y -i— n — 1 

4: while ^ 1 do 

5: fc <— A; - 1 

6: a; •*— afe + 1 

7: while 2a: < y do 

8: Ofc <— X 

10: A; <— fc + 1 

11: end while 

12: £^k+l 

13: while X < y do 

14: afe <— a; 
15: y 
16: visit ai . . . 

17: :r ^ x + 1 
18: y^y-1 

19: end while 

20: ly ^ jy + a; — 1 

21: Qk^y+l 
22: visit ai . . . ttfe 

23: end while 



successor simply by incrementing ak-i and decrementing afe. Otherwise, we 
revert to the standard means of computing the lexicographic successor. By 
analysing this algorithm, we shall see that this approach provides significant 
gains. We concentrate on the analysis of Algorithm 4.1 here — see Kelleher [22, 
§4.4.2] for further discussion and proof of correctness. 

Lemma 4.3. The number of times line 16 is executed during the execution of 
Algorithm 4-1 is given by tiein) = p{n — 2). 

Proof. Compositions visited on line 16 must be nonterminal because upon reach- 
ing line 12, the condition 2x > y must hold. As x and y arc the second-last and 
last parts, respectively, of the composition visited on line 16, then this compo- 
sition must be nonterminal by definition. Subsequent operations on x and y 
within this loop do not alter the property that 2a; > y, and so all compositions 
visited on line 16 must be nonterminal. 

Furthermore, we also know that all compositions visited on line 22 must 
be terminal. To demonstrate this fact, we note that if ai . . . is the last 
composition visited before we arrive at line 20, the composition visited on line 22 
must be ai . . . ak-2{ak-i + «fe)- Therefore, to demonstrate that this composition 
is terminal, we must show that 2ak-2 < ctfe-i + ctfe. We know that ak-2 < 
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Ofe-i < ife- It follows that 2afe_2 < 2afc_i, and also that 2afe_i < a^-i + a^. 
Combining these two inequalities, we see that 2ak-2 < 2afe_i < a^-i + a^, and 
so 2ak-2 < afc_i+afe. Thus all compositions visited on line 22 must be terminal. 

Then, as Algorithm 4.1 correctly visits all p{n) ascending compositions of 
n [22, p. 105], since all compositions visited on line 22 are terminal and as all 
compositions visited on line 16 are nonterminal, we know that all nonterminal 
compositions of n must be visited on line 16. By Corollary 4.2 there arc p{n — 2) 
nonterminal compositions of n, and hence tig = p{n — 2), as required. □ 

Lemma 4.4. The number of times line 5 is executed during the execution of 
Algorithm 4-1 is given by t^{n) = p{n) — p{n — 2). 

Proof. By Lemma 4.3 wc know that the visit statement on line 16 is executed 
p{n — 2) times. As Algorithm 4.1 correctly visits allp(n) ascending compositions 
of n, then the remaining — — 2) compositions must be visited on line 22. 
Clearly then, hne 22 (and hence line 5) is executed p{n) — p{n — 2) times. 
Therefore, t^ = p{n) — p{n — 2), as required. □ 

Lemma 4.5. The number of times line 10 is executed during the execution of 
Algorithm 4-1 is given by tio{n) = p{n) — p{n — 2) — 1. 

Proof. The variable A; is assigned the value 2 upon initialisation, and the algo- 
rithm terminates when k = 1. As the variable is only updated via increment 
(line 10) and decrement (line 5) operations, we know that there must be one 
more decrement operation than increments. By Lemma 4.4 we know that there 
arep(n)— p(n— 2) decrements, and so there must he p{n)—p{n—2) — l increments 
on the variable. Therefore, tio = p{n) — p{n — 2) — 1. □ 

Theorem 4.2. Algorithm 4-1 requires RA4.i{n) = p{n)—p{n—2) read operations 
to generate the set A{n) . 

Proof. Only one read operation occurs Algorithm 4.1, and this is done on line 6. 

By Lemma 4.4 wc know that line 5 is executed p{n) — p{n — 2) times, and it 
immediately follows that line 6 is executed the same number of times. Therefore, 
RA4.i{n) = p{n) — p{n — 2), as required. □ 

Theorem 4.3. Algorithm 4-1 requires WA4.i{n) = 2p{n) — 1 write operations 
to generate the set A{n), excluding initialisation. 

Proof. Write operations are performed on lines 8, 14, 15 and 21. Lemma 4.4 
shows that line 21 is executed p{n) -'p{n — 2) times. From Lemma 4.5 we know 
that line 8 is executed p{n) — p{n — 2) — 1 times. Then, by Lemma 4.3 we 
know that lines 14 and 15 are executed p{n — 2) times each. Summing these 
contributions we get WA4.i{n) = p{n)—p{n—2)+p{n)—p{n—2)~l + 2p{n—2) = 
2p{n) — 1, as required. □ 

Theorems 4.2 and 4.3 derive the precise number of read and write operations 
required to generate all partitions of n using Algorithm 4.1. This algorithm is a 
considerable improvement over our basic implementation of the succession rule. 
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Algorithm 3.1, in two ways. Firstly, by keeping p{n — 2) of the visit operations 
within the loop of lines 13-19, we significantly reduce the average cost of a write 
operation. Thus, although we do not appreciably reduce the total number of 
write operations involved, wc ensure that 2p(n — 2) of those writes arc executed 
at the cost of an increment and decrement on a local variable and the cost of a 
< comparison of two local variables — in short, very cheaply. 

The second improvement is that we dramatically reduce the total number of 
read operations involved. Recall that RuleAsc required 2p{n) read operations 
to generate all ascending compositions of n; Theorem 4.2 shows that AccelAsc 
requires only p{n) — p{n — 2) read operations. Wc also reduced the number of 
read operations by a factor of 2 by maintaining the value of y between iterations 
of the main while loop, but this could equally be applied to RuleAsc, and is 
only a minor improvement at any rate. The real gain here is obtained from 
exploiting the block-based nature of the set of ascending compositions, as we do 
not need to perform any read operations once we have begun iterating through 
the nonterminal compositions within a block. 

4.2 Descending Compositions 

In Section 3.2 we derived a direct implementation of the succession rule for 
descending compositions. We then analysed the cost of using this direct im- 
plementation to generate all descending compositions of n, and fomid that it 
implied an average of 0{^/n) read and write operations per partition. There 
are, however, several constant amortised time algorithms to generate descending 
compositions, and in this section we study the most efficient example. 

There is one basic problem with the direct implementation of the succession 
rule for descending compositions (RuleDesc): most of the read and write 
operations it makes are redundant. To begin with, the read operations incurred 
by RuleDesc in scanning the current composition to find the rightmost non-1 
value are unnecessary. As McKay [34] noted, we can easily keep track of the 
index of the largest non-1 value between iterations, and thereby eliminate the 
right-to-left scan altogether. The means by which we can avoid the majority 
of the write operations is a little more subtle, and was first noted by Zoghbi & 
Stojmenovic [62]. For instance, consider the transition 

3321111 ^ 33111111. (23) 

RuleDesc implements the transition from 3321111 to 33111111 by finding the 

prefix 33 and writing six copies of 1 after it, oblivious to the fact that 4 of the 
array indices already contain 1. Thus, a more reasonable approach is to make a 
special case in the succession rule so that if rfg = 2, we simply set dq 1 and 
append 1 to the end of the composition. This observation proves to be sufficient 
to remove the worst excesses of RuleDesc, as Is are by far the most numerous 
part in the partitions of n. 

Zoghbi & Stojmcnovic's algorithm implements both of these ideas, and 
makes one further innovation to reduce the number of write operations required. 
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By initialising the array to hold n copies of 1, we know that any index > k must 
contain the value 1, and so we can save another write operation in the special 
case oi dq = 2 outlined above. Thus, Zoghbi & Stojmenovic's algorithm is the 
most efficient example, and conscqiiently it is the algorithm that we shall use 
for our comparative analysis. Knuth developed a similar algorithm [27, p. 2]: he 
also noted the necessity of keeping track of the value of q between iterations, 
and also implemented the special case for dg outlined above. Knuth's algo- 
rithm, however, does not contain the further improvement included by Zoghbi 
& Stojmenovic (i.e. initialising the array to 1 ... 1 and avoiding the second write 
operation in the dq = 2 special case), and therefore requires strictly more write 
operations than Zoghbi & Stojmenovic's. Zoghbi & Stojmenovic's algorithm 
also consistently outperforms Knuth's algorithm in empirical tests. 

Zoghbi & Stojmenovic's algorithm is presented in Algorithm 4.2, which we 
shall also refer to as AccelDesc. Each iteration of the main loop implements 
a single transition, and two cases are identified for performing the transition. In 
the conditional block of lines 8-10 we implement the special case for dq = 2: we 
can see that the length of the composition is incremented, dq is assigned to 1 and 
the value of q is updated to point to the new rightmost non-1 part. The general 
case is dealt with in the block of lines 11-29; the approach is much the same 
as that of RuleDesc, except in this case we have the additional complexity of 
maintaining the value of q between iterations. 

Lemma 4.6. The number of times line 10 is executed during the execution of 
Algorithm 4-2 is given by tio{n) = p{n — 2). 

Proof. The variable q points to the smallest non-1 value in d\ . . .dk, and we 
have a complete descending composition in the array each time we reach line 7. 
Therefore, line 10 will be executed once for every descending composition of 
n which contains at least one 2; and it is well known that this is p{n — 2). 
Therefore, tiQ{n) = p{n — 2), as required. □ 

Lemma 4.7. The number of times line 16 is executed during the execution of 
Algorithm ^.2 is given by t\^{n) + t-2,f,(n) = p{n — 2) — 1. 

Proof. The variable q controls the termination of the algorithm. It is initialised 
to 1 on line 2, and the algorithm terminates when 5 = 0. We modify q via 
increment operations on lines 16 and 25, and decrement operations on line 10 
only. Therefore, there must be one more decrement operation than increments 
on q. By Lemma 4.6 there are p{n — 2) decrements performed on q, and there 
must thereforebep(n— 2)— 1 increments. Therefore, ti6{n)+t25{n) = p(n— 2)— 1, 
as required. □ 

Theorem 4.4. Algorithm 4-2 requires RA4.2{n) = 2p{n) — p{n — 2) — 2 read 
operations to generate the set T>{n). 

Proof. Read operations are performed on lines 7 and 12 of Algorithm 4.2. 
Clearly, as all but the composition (n) are visited on line 30, line 7 is exe- 
cuted pin) — 1 times. Then, as a consequence of Lemma 4.6, we know that 
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Algorithm 4.2 AccELDESC(n) 
Require: n > 1 

1: fc ^ 1 

2: 1 

3; d2...dn^ 

4: rfi <— n 

5: visit di 

6: while g ^ do 



7: if = 2 then 

8: k+1 

9: rfq ^ 1 

10: q ^ q - 1 

11: else 

12; m ^ dq — 1 

13: n' <— fc - gr + 1 

14: dq ^ m 

15: while n' > m do 

16: g ^ g + 1 

17: dq ^ m 

18: n' <— n' — m 

19: end while 

20: if n' = then 

21: k = q 

22: else 

23: fc ^ q + 1 

24: if n' > 1 then 

25: 5 <— g + 1 

26: rfq ^ n' 

27: end if 

28: end if 

29: end if 

30: visit d\. . .dk 



31: end while 



line 12 is executed p(n) — p(n — 2) — 1 times. Therefore, the total number of 
read operations is given by RAA.2{ri) = 2p{n) — p{n — 2) — 2, as required. □ 

Theorem 4.5. Algorithm 4-2 requires Wa4.2(") = p{n) + p(n — 2) — 2 write 

operations to generate the set excluding initialisation. 

Proof. After initialisation, write operations are performed on lines 9, 14, 17 
and 26 of Algorithm 4.2. Line 9 contributes p{n — 2) writes by Lemma 4.6; 
and similarly, line 14 is executed p{n) — p{n — 2) — 1 times. By Lemma 4.7 
we know that the total number of write operations incurred by lines 17 and 26 
is p{n — 2) — 1. Therefore, summing these contributions we get WAi.2{n) = 
p{n) +p{n — 2) — 2, as required. □ 
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Theorems 4.4 and 4.5 show that Zoghbi & Stojmenovic's algorithm is a 
vast improvement on RuleDesc. Recall that RuLEDESC(n) requires roughly 
J22=iP(^) ^^^^ and Y^2=iP(^) write operations; and we have seen that Ac- 
CELDESC(n) requires only 2p{n) — p{n — 2) read and p{n) + p{n — 2) write 
operations. 

Zoghbi & Stojmenovic [62] also provided an analysis of AccelDesc, and 
proved that it generates partitions in constant amortised time. Wc briefly sum- 
marise this analysis to provide some perspective on the approach we have taken. 
Zoghbi & Stojmenovic begin their analysis by demonstrating that D{n, m) > 
for all m > 2, where D{n, m) enumerates the descending compositions of 
n in which the initial part is no more than m. They use this result to reason 
that, for each dq>2 encountered, the total number of iterations of the internal 
while loop is < 2c, for some constant c. Thus, since the number of iterations 
of the internal loop is constant whenever dq > 2> (the case for dq = 2 obvi- 
ously requires constant time), the algorithm generates descending compositions 
in constant amortised time. 

The preceding paragraph is not a rigorous argument proving that AccelDesc 
is constant amortised time. It is intended only to illustrate the difference in the 
approach that we have taken in this section to Zoghbi & Stojmenovic's anal- 
ysis, and perhaps highlight some of the advantages of using Kemp's abstract 
model of counting read and write operations [23]. By using Kemp's model we 
were able to ignore irrelevant details regarding the algorithm's implementation, 
and concentrate instead on the algorithm's effect: reading and writing parts in 
compositions. 

4.3 Comparison 

Considering AccelAsc (Algorithm 4.1) first, we derived the following numbers 
of read and write operations required to generate all ascending compositions of 
n, ignoring inconsequential trailing terms. 

RAi.i{n) « p{n) - p{n - 2) and WA4.i{n) « 2p{n) (24) 

We can see that the total number of write operations is 2p{n); i.e., the total 
number of write operations is twice the total number of partitions generated. On 
the other hand, the total number of read operations required is only p{n) —p{n — 
2), which, as we shall see presently, is asymptotically negligible in comparison 
to p{n). The number of read operations is small because we only require one 
read operation per iteration of the outer loop. Once we have stored ak-i in a 
local variable, we can then extend the composition as necessary and visit all 
of the following nonterminal compositions without needing to perform a read 
operation. Thus, it is the write operations that dominate the cost of generation 
with this algorithm and, as we noted earlier, the average cost of a write operation 
in this algorithm is quite small. 

For the descending composition generator, AccelDesc (Algorithm 4.2), the 
following read and write totals were derived (we ignore the insignificant trailing 
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terms in both cases). 

i?A4.2(n) « 2p{n) - p{n - 2) and WA4.2{n) « p{n) + p{n - 2) (25) 

The total number of write operations required by this algorithm to generate 
all partitions of n is p{n) + p{n — 2). Although this value is strictly less than 
the write total for AccelAsc, the difference is not asymptotically significant 
as p{n — 2)/p{n) tends towards 1 as n becomes large. Therefore, we should not 
expect any appreciable difference between the performances of the two algo- 
rithms in terms of the number of write operations involved. There is, however, 
an asymptotically significant difference in the number of read operations per- 
formed by the algorithms. 

The total number of read operations required by AccelDesc is 2p{n) — 
p(n — 2) . This expression is complicated by an algorithmic consideration, where 
it proved to be more efficient to perform p{n) — p{n — 2) extra read operations 
than to save the relevant value in a local variable. Essentially, AccelDesc 
needs to perform one read operation for every iteration of the external loop, to 
determine the value of dq. If dg = 2 we execute the spcxdal case and quickly 
generate the next descending composition; otherwise, we apply the general case. 
We cannot keep the value of dq locally because the value of q changes constantly, 
and so wc do not spend significant periods of time operating on the same array 
indices, as we do in AccelAsc. Thus, we must read the value of dq for every 
transition, and we can therefore simplify by saying that AcCELDESC(n) requires 
p{n) read operations. 

In the interest of the fairest possible comparison between ascending and 
descending compositions generation algorithms, let us therefore simplify, and 
assume that any descending composition generation algorithm utilising the same 
properties as AccelDesc requires p{n) read operations. We know from (24) 
that our ascending composition generation algorithm required only p{n)—p{n — 
2) reads. We can therefore expect that an ascending composition generator will 
require p(n — 2) less read operations than a descending composition generator 
similar to AccelDesc. Other things being equal, we should expect a significant 
difference between the total time required to generate all partitions using an 
ascending composition generation algorithm and a commensurable descending 
composition generator. 

Wc can gain a qualitative idea of the differences involved if we examine the 
average numbers of read and write operations using the asymptotic values of 
p{n). Again, to determine the average number of read and write operations 
required per partition generated we must divide the totals involved by p{n). 
We stated earlier that the value of p{n) — p{n — 2) is asymptotically negligible 
compared to p{n)\ we can quantify this statement using the asymptotic formulas 
for p{n). Knuth [27, p. 11] provides an approximation of p{n — 2)/p{n), which 
can be expressed as follows: 

^ - (26) 
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Using this approximation, we obtain the following estimates for the average 
number of read and write operations required to generate each ascending and 
descending composition of n. 



Reads 



Writes 



Ascending 1 — e 
Descending 1 



2tt/V6'. 



m 



2 

1 + e-^VVe^^ 



Suppose we wished to generate all partitions of 1000. Then, using the best 
known descending composition generation algorithm we would expect to make 
1 read and 1.92 write operations per partition generated. On the other hand, 
if we used AccelAsc, we would expect to make only 0.08 read and 2 write 
operations per partition. 

The qualitative behaviour of AccelAsc and AccelDesc can be seen from 
their read and write tapes (Figure 2). Comparing the write tapes for the algo- 
rithms, we can see that the total number of write operations is roughly equal 
in both algorithms, although they follow an altogether different spatial pattern. 
The read tapes for the algorithms, however, demonstrate the essential difference 
between the algorithms: AccelDesc makes one read operation for every parti- 
tion generated, while the read operations for AccelAsc are sparsely distributed 
across the tape. 

We have derived expressions to count the total number of read and write 
operations required to generate all partitions of n using AccelAsc and Ac- 
celDesc. We can now use these expressions to make some quantitative predic- 
tions about the relative efficiencies of the algorithms. If we assume that the cost 
of read and write operations are equal, we can then derive a prediction for the 
ratio of the total time elapsed using both algorithms. Therefore, let i?4.i(n) be 
the expected total running time of AcCELAsc(n), and similarly define E^^^in) 
for AccELDESC(n). We can then predict that the ratio of the running times 
should be equal to the ratio of their total read and write counts. Thus, using 
the values of (24) and (25), we get 



Consequently, we expect that the total amount of time required to generate 
all ascending compositions of n should be a factor of p{n — 2)/3p(n) less than 
that required to generate all descending compositions of n. To test this hypoth- 
esis we measured the total elapsed time required to generate all partitions of 
n using AccelAsc and AccelDesc, using the methodology outlined in Sec- 
tion 2.3. We report the ratio of these times in Table 2, for both the C and Java 
implementations of the algorithms. 

Table 2 supports our qualitative predictions well. The theoretical analysis of 
ascending and descending composition generation algorithms in this section sug- 
gests that the ascending composition generator should require significantly less 
time to generate all partitions of n than its descending composition counterpart; 
and the data of Table 2 supports this prediction. In the Java implementations. 



i?4.i(n) 3p(ri) — p(n — 2) 
EA.2{n) 3p{n) 



(27) 



32 



• • • 



Read {35) 
AccelAsc(12) 



Write {154) 



p{12) = 77 



(77) Read 



{119) Write 
AccelDesc(12) 



Figure 2: Read and write tapes for the accelerated algorithms to generate as- 
cending and descending compositions. On the loft wo have the read and write 
tapes for the ascending composition generator, Algorithm 4.1; on the right, 
then, are the corresponding tapes for the descending composition generator, 
Algorithm 4.2. In both cases, the traces correspond to the read and write op- 
erations carried out in generating all partitions of 12. 
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Table 2: Empirical analysis of accelerated ascending and descending composition 
generation algorithms. The ratio of the time required to generate all partitions 
of n using AccelAsc and AccelDesc is given: measured ratios for implemen- 
tations in the Java and C languages as well as the theoretically predicted ratio 
are shown. 



n 


p{n) 


Java 


C 


Theoretical 


100 


1.91 X 10^ 


0.85 


0.77 


0.74 


105 


3.42 X 10^ 


0.85 


0.77 


0.74 


110 


6.07 X 10^ 


0.84 


0.75 


0.74 


115 


1.06 X 10^ 


0.84 


0.75 


0.73 


120 


1.84 X 10'' 


0.83 


0.75 


0.73 


125 


3.16 X lO'' 


0.83 


0.74 


0.73 


130 


5.37 X 109 


0.83 


0.74 


0.73 


135 


9.04 X lO'' 


0.82 


0.74 


0.73 






= 0.9891 


;■(. = ().9:-!21 





the ascending composition generator requires 15% less time to generate all par- 
titions of 100 than the descending composition generation algorithm; in the C 
version, the difference is around 23%. These differences increase as the value of 
n increases: when n = 135, we see that AccelAsc requires 18% and 26% less 
time than AccelDesc in the C and Java implementations, respectively. 

We also made a quantitative prediction about the ratio of the time required 
to generate all partitions of n using AccelAsc and AccelDesc. Using the 
theoretical analysis, where we counted the total number of read and write op- 
erations required by these algorithms, we can predict the expected ratio of the 
time required by both algorithms. This ratio is also reported in Table 2, and 
we can see that it is consistent with the measured ratios for the Java and C 
implementations of the algorithms. In the case of the Java implementation, 
the theoretically predicted ratios are too optimistic, suggesting that the model 
of counting only read and write operations is a little overly simplistic in this 
case. The correspondence between the measured and predicted ratios in the C 
implementation is much closer, as we can see from Table 2. In both cases there 
is a strong positive correlation between the predicted and measured ratios. 

5 Conclusion 

In this paper we have systematically compared algorithms to generate all ascend- 
ing and descending compositions, two possible encodings for integer partitions. 
In Section 2 we compared two recursive algorithms: our new ascending compo- 
sition generator, and Ruskey's descending composition generator. By analysing 
these algorithms we were able to show that although both algorithms are con- 
stant amortised time, the descending composition generator requires approxi- 
mately twice as long to generate all partitions of n. In Section 3 we compared 
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two generators in Kemp's idiom: succession rules that require no state to be 
maintained between transitions. We developed a new succession rule for as- 
cending compositions in lexicographic order, and implemented the well known 
succession rule for descending compositions in reverse lexicographic order. The 
analyses of these algorithms showed that the ascending composition generator 
required constant time, on average, to make each transition; whereas the de- 
scending composition generator required 0(\/n) time. Section 4 then compared 
the most efficient known algorithms to generate all ascending and descending 
compositions. We developed a new generation algorithm for the ascending com- 
positions by utilising structure within the set of ascending compositions. We 
also analysed Zoghbi & Stojmenovic's algorithm and compared these two al- 
gorithms theoretically and empirically. As a result of this analysis, we showed 
that the ascending composition generator requires roughly three quarters of the 
time required by the descending composition generator. These three compar- 
isons of algorithms show that ascending compositions are a superior encoding 
for generating all partitions. 

Generation efficiency is not the only advantage of encoding partitions as 
ascending compositions. As part of our analysis of the succession rule for as- 
cending compositions in Section 3 we proved a new formula for computing the 
number of partitions of n in terms of the largest and second largest parts. In 
Section 4.1 we developed a new proof for a combinatorial identity, showing that 
the number of partitions of n where the largest part is less that twice the second 
largest part is equal to the number of partitions of n — 2. These mathematical 
results were motivated by studying algorithms to generate ascending composi- 
tions. 

Another advantage of using ascending compositions to encode partitions, not 
mentioned here, is the possibility of developing algorithms to generate a very 
flexible class of restricted partition. By generalising the algorithms developed in 
this paper it is possible to generate (and enumerate) combinatorially important 
classes of partition such as the partitions into distinct parts [8, §2], Rogers- 
Ramanujan partitions [17] and Gollnitz-Gordon partitions [3]. The framework 
for describing these restrictions and developing generation and enumeration al- 
gorithms is described by Kelleher [22, ch.3-4]. 
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